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Abstract 

In 1984, Kurt Mahler posed the following fundamental question: 
How well can irrationals in the Cantor set be approximated by ra- 
tionals in the Cantor set? Towards development of such a theory, 
we prove a Dirichlet-type theorem for this intrinsic Diophantine ap- 
proximation on Cantor-like sets, and discuss related possible theorems 
and conjectures. The resulting approximation function is analogous 
to that for M'^, but with d being the Hausdorff dimension of the set, 
and logarithmic dependence on the denominator instead. 

1 Introduction 

The Diophantine approximation theory of the real line is classical, extensive, 
and essentially complete as far as characterizing how well real numbers can 
be approximated by rationals {\10\ is a standard reference). The basic result 
on approximability of all reals is 

Theorem 1.1 (Dirichlet's Approximation Theorem). For each x G M, and 
for any Q gN, there exist | G Q with I < q < Q , such that 

\x < 



Corollary 1.2. For every x G 



\x < ^ for infinitely many - G Q. (1) 

' q' q 
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That this rate cannot be improved significantly for all reals is evidenced 
by the existence of badly-approximable numbers (see definition 13. ip . Fur- 
thermore, the rate cannot be improved for most irrationals, in the sense that 
the set of very well approximable numbers (see definition I3.2p is null. In ad- 
dition, the subject of approximating points on fractals by rationals has been 
extensively studied in recent years; see for example [2], [S], and [7] for badly 
approximable numbers and [5] and [TT] for very well approximable numbers. 

In 1984, K. Mahler published a paper entitled "Some suggestions for 
further research" [9j, in which he writes: "At the age of 80 I cannot expect 
to do much more mathematics. I may however, state a number of questions 
where perhaps further research might lead to interesting results." One of 
these questions was regarding intrinsic and extrinsic approximation on the 
Cantor set. In Mahler's words, "how close can irrational elements of Cantor's 
set be approximated by rational numbers 

1. In Cantor's set, and 

2. By rational numbers not in Cantor's set?" 

As far as the authors are aware, there has been very little research on 
such intrinsic Diophantine approximation on fractals, though there is such 
for algebraic varieties (see for example [4]). It is also worth noting that 
while addressing non-intrinsic questions, the authors of [8j used intrinsic 
techniques. Studying the intersection of C with numbers approximable by 
rationals | with q of the specific form 3", they obtain lower bounds for gen- 
eral, non-intrinsic approximation. They point out that this type of rational 
approximation to a point of C will necessarily lie in C. Thus some of their 
results are de facto also results giving lower bounds on approximations on 
C by any rationals in C. However, these results are not likely to be optimal 
since only Cantor rationals with denominators of the form 3" are considered, 
so we wish to consider intrinsic approximation as a topic in and of itself. We 
present some initial steps towards a theory determining the analogous opti- 
mal rate of intrinsic approximation for the Cantor set (and those constructed 
similarly) . 

The paper is organized as follows. In the following section we present our 
main result. Section 3 is devoted to formulating our conjecture regarding the 
distribution of rationals on the Cantor set, which if true would allow us to 
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conclude that our main theorem is optimal. The last section is devoted to 
extrinsic approximation on the Cantor set, i.e., approximating irrationals on 
the Cantor by rationals not on the Cantor set . 

2 Dirichlet type theorem for Cantor sets 

Let C denote the Cantor-like set consisting of numbers in J = [0, 1] which 
can be written in base b > 2 using only the digits in S* C {0, 1, . . . , 6 — 1}, 
where \S\ = a > 1. Obviously this is equivalent to partitioning / into b equal 
sub intervals, only keeping those indexed by S, and successively continuing 
this on each remaining subinterval. (The usual middle-thirds set is given by 
b = 3, S = {0, 2}.) We will assume that 5 is a proper subset of {0, 1, . . . , 6 — 
1}, i.e. that C 7^ /, as the case C = / is simply the classical one. 

Throughout the paper we denote by {x} the fractional part of x and by 
[x\ the integer part. To state the theorem we also associate to the set the 
number bo, where 60 is the least integer greater than 1 such that C is invariant 
under x {box}. As we shall see, this is simply equal to b except when the 
given definition of C is redundant in a certain sense, in which case we still 
have b = Uq for some r G N. 

Theorem 2.1 (Dirichlet for Cantor sets). Let d = dimC. For every x & C 
and every Q of the form 6", there exists p/g G Q fl C with q < b^ such that 

q' qQ 

It follows from the statement that for any Q G N, the same statement 
holds with the bound multiplied by b (by letting < Q < 6"+^). As an 
immediate corollary to Theorem 12.11 we get: 

Corollary 2.2. For all a; G C, there exist infinitely many solutions p G N, 
g G N, £ G C to 

\x — — *\ . 

r ^(logfeog)^/'^' 

Also notice that the approximation function's asymptotic behavior gets 
better as ci — )■ 0, even though the first such q whose existence we prove can 
tend to infinity as b does. 

Before we begin the proof, we first need a characterization of the rationals 
in C: 
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Lemma 2.3. A rational number is in C if and only if it can be written either 
as a terminating base-b expansion (left end points in the construction) or as 

where l,k and Ci G S. 

Equivalently ^ can also be expressed in terms of base-b expansions, i.e., 

{{coCi...Ck+l-l)b - {coCi...Ck-l)b) . V 

Proof. A rational in C has either a terminating 6-ary expansion (consisting 
of digits from S) or an eventually periodic one. If it is purely periodic of 
period /, it has the form 

b^~^Xo + b^'^xi + ... + xi-i U~^Xq + b^-'^xi + ... + xi^i 



n=l 



{uy u - 1 



where the digits Xi are in S. If the rational is not purely periodic then 
one must insert some number k of initial zeros and then add the initial 
terminating expansion of length fc, so we obtain the form 

U-^xq + U-^xi + ... + xi-i ^ - l){b''-^yo + b^-^yi + ... + yk-i) 



{U - l)b^ {U - l)b^ 

for some digits yi & S. Rearranging this gives the result. □ 

Proof of theorem \2.1\ Let a; G C \ Q. Given any n G N, let Q = 6". Denote 
by M = (mj) the semigroup of positive integer multiplication maps mod 1 
leaving C invariant. We order the elements in increasing order from 1 = mo- 
There are a" possibilities for the first n digits in the 6-ary expansion of {qx}. 
Consider the elements of C given by 0, {mQx},...{man-ix}. By the pigeonhole 
principle either there exist < g, g' < a" such that {mqx} and {niqix} have 
the same first n digits, or the same holds for some < g < a" and 0. That is, 
they are in the same interval of the n-th stage of C"s construction. Assuming 
the former, it follows that |{mg/x} — {m^x}! < ^. Rewriting gives 

\mqix — \ rnqix\ — nigX + \ rnqx\ | < — . 
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Setting p = [mqix\ — [m^xj we get 

I Pi 1 

X — <^ . 

m'^-mq^ {m'q-mq)Q' 

If one of the above values is rather than an rrii the calculation is triv- 
ial. Suppose M were generated by more than one element. Then by H. 
Furstenberg's seminal result in [3], stating that the only infinite closed sub- 
set of M/Z invariant under M is M/Z itself, we would have to have C = I. 
But we assumed C 7^ /, so that cannot be true. (Notice however that 
we recover Dirichlet's original theorem in this case, since then M = N, so 
uiqi — niq < aJ^ = Q.) So M has a single generator, which we will denote 
60, and then b = for some r, and a = Oq. C is also the set constructed 
in the corresponding way for bo and some 5*0 (namely, the unique subset of 
{0, 1, . . . , 60 ~ 1} such that 5* is the set Sq of r-fold concatenations of ele- 
ments of So). Then man_i = 69 Since niq' — niq = 69(^0 ~ ^) some 
integers k, d, following ([3]) it suffices to observe that p = [fog^'^xj — [bQx\ . 

Thus — 2 — e C, and m„/ - m„ < bf = bf . □ 

It is worth noticing a similarity in form between Corollary 12.21 and the 
more general form of Dirichlet's theorem, stating that points in are ap- 
proximable according to the function ^^17? • However there is a "gap" between 

the approximation functions in these two theorems - the q^^^ from the clas- 
sic case is replaced by a qY^'^ in the fractal case, so that the intrinsic 
approximation function on C does not tend to even as dimC tends to 
1. The reason for this dichotomy is evident from the proof, which hinges on 
the growth of in terms of k: When M has two generators, mk = k, corre- 
sponding to the classical case, and when M has a single generator, = 6q, 
corresponding to the fractal case. These two possibilities for the asymptotics 
of TUk - either exponential or linear - lead to the corresponding differences 
in the asymptotics of the approximation functions. 

3 Further investigation, conjectures and con- 
sequences 

The obvious next step would be to check whether this is the optimal rate of 
approximation for C. We first recall the following definitions: 
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Definition 3.1. The set of badly approximable numbers (BA) consists of all 
reals x such that for some c{x) > 

I J) ci X] 

--x >-V- forallp/qeQ. 
\q q2 

We thus define, relative to the approximation function proven above, the 
intrinsic set of badly approximable numbers, denoted by BAc, as those in 
C satisfying 

\--x\> ^^^\,,, for all p/g e Q n C. 

Definition 3.2. The set of very well approximable numbers fVWA^ is the 
set of all reals x satisfying for some positive e{x) 

x\ < 



• q ■ g2+6(a;) 

for infinitely many rationals p/q. 

We thus define the intrinsic VWA numbers, denoted VWAg, as those 
in C satisfying 

p 1 

\ - -x\< ^ . w rYji for infinitely many p/q e C, 

q gi+<^)(log(,gg)V<=' 

or equivalently, since e{x) > is arbitrary and can depend on x, as those in 
C satisfying 

p 1 

I ^1 < i+e{x) infinitely many p/q E Qf] C. (4) 

One could definitively say that our approximation function is optimal if one 
proved that intrinsic BAc is nonempty, and one could conclude that the 
power of q in the denominator is correct if VWA^ has zero d-dimensional 
Hausdorff measure. 

These results appear difficult to achieve, however, without having a deeper 
understanding of the rationals in C. For the real fine, there exists a large 
arsenal of useful information regarding distribution properties of rationals, 
their quantity within bounds on q, etc., whereas for C nothing is even known 
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about which denominators can appear (in reduced form, of course; the ex- 
pression ([3]) is not useful here since it is not reduced). In fact Mahler points 
out basically the same fundamental difficulty. To obtain the desired results, 
we believe a major new piece of information will be necessary, such as a 
sharp bound either on how many rationals appear in C with (reduced) de- 
nominators in a given range or on how these rationals "repel" each other as a 
function of q. We nonetheless make the following conjecture for the standard 
Cantor set supported by computer calculations (see table 1 below). 

Conjecture 3.3. Let N{s,t) = ||{| G C,gcd{p,q) = I, s < q < t}\\. Then 
A^(3",3"+i) G 0(2(1+^)") for all e > 0. 

1 A^(3" 3"+^) 

Notice that defining (p^n) = 2 ]\j(^^n-i ^ny conjecture is equivalent to 

the convergence of 0(n) to 1. The table below sums up our computer based 
results: 



Table 



n 


4 


5 


6 


7 


8 


9 


10 


11 


0(n) 


1.808 


1.064 


1.32 


1.18 


1.258 


1.057 


1.176 


1.063 



The conjecture is significant, implying that the power of q in our theorem 
cannot be improved. 



Corollary 3.4 (of conjecture ISTSb . /i(VWAc) 

dimensional H aus dor ff measure restricted to C. 

Proof. Let K denote the subset of VWAc satisfying 
e(x) = e, and observe that K is the lim sup of the sets 



Sn= U 

p/geC,3"<g<3"+i 



1 p 



1 



qi+e q 



0, where ^ is the d- 
for a particular 

(5) 



By the Borel-Cantelh lemma, niVe) = if Yl^=ol^i.^n) < oo. The number 
of intervals in the union Sn is by assumption in 0(2^^"'"^''") for all e' > 0. 

and fj, satisfies a rf-power law, so we 



The radius of each interval is 0( 



3(i+«)i 
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have that fi{B{x,r)) is 0(r'^) for x E C and small r. So the measure of each 
interval is 0( ^(i+,)„ ), and 

/^(^n) G O (^^(Tijr^J = 0(2(^'-)") for all e' > 0. (6) 

Thus we take e' < e, and obtain that J2'^=o^^(^n) < c>o as desired. Finally 
observe VWA^ = U^^^Vi/m, so /i(VWAc) = 0. 

□ 

At this point the reader might be thinking that a stronger bound on 
would allow one to prove that theorem 12.11 is even sharper: specifically, 

if A^(3", S""*"^) G 0(2"), then a proof of similar type would show that the 

superset 

\x : x\ < — ,^ , , , , , for infinitely many p/q G C} D VWA^ 

is yU-null. However this avenue is not possible, as a forthcoming followup 
paper by Noam Solomon and Barak Weiss discussing Conjecture 13.31 and 
partial progress towards it will prove that this stronger bound on does not 
hold. 



4 Extrinsic approximation 

As mentioned in the introduction, Mahler also raised the complementary 
extrinsic question: how well can a rational in C be approximated by rationals 
outside of C? To this end, consider a weaker variant of Conjecture 13.31 

Conjecture 4.1. A(3",3"+^) G 0(2^") for some 7 < 2. 

Note that a confirmation of this conjecture (for which our calculations are 
quite strong evidence) would be sufficient to show, by an argument similar 
to the one above, that the set 

{x E C : a; < — for infinitely many - G C| (7) 

I- ' g ' q 

is /i-nuU. In turn, consider the set of x G C such that 

P iP I ^ 

For all e > 0, there exist infinitely many - ^ C satisfying — x\ < — . (8) 

q ' q ' q^ 
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We denote this extrinsic analogue of WA, the complement of BA, by WAc= 
and its complement by BAc^. Thus (JTj), combined with Corollary 1.10 of [1] 
stating that BA fl C is /i-null, show the following: 

Corollary 4.2 (of conjecture 14.11) . WAc-c is of fx-full measure. 

We note that the sharpness in either direction of this statement is already 
known by [B] and 
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